Improving part-of-speech tagging in Amharic language using deep neural network
نویسندگان
چکیده
To date, several POS taggers have been introduced to facilitate the success of semantic analysis for different languages. However, task tagging becomes a bit intricate in morphologically complex languages, like Amharic. In this paper, we evaluated models such as bidirectional long short term memory, convolutional neural network combination with and conditional random field Amharic tagging. Various features, both language-dependent -independent, explored model. Besides, word-level character-level features are analyzed deep models. A is utilized encoding at word character level. Each model's performance has on dataset that contained 321 K tokens manually tagged 31 tags. Lastly, best obtained by an end-to-end model, memory field, 97.23% accuracy. This highest accuracy competent contemporary currently existing
منابع مشابه
Methods for Amharic Part-of-Speech Tagging
The paper describes a set of experiments involving the application of three state-ofthe-art part-of-speech taggers to Ethiopian Amharic, using three different tagsets. The taggers showed worse performance than previously reported results for English, in particular having problems with unknown words. The best results were obtained using a Maximum Entropy approach, while HMM-based and SVMbased ta...
متن کاملPart of Speech Tagging for Amharic using Conditional Random Fields
We applied Conditional Random Fields (CRFs) to the tasks of Amharic word segmentation and POS tagging using a small annotated corpus of 1000 words. Given the size of the data and the large number of unknown words in the test corpus (80%), an accuracy of 84% for Amharic word segmentation and 74% for POS tagging is encouraging, indicating the applicability of CRFs for a morphologically complex la...
متن کاملA Neural Network Approach to Part-of-Speech Tagging*
Neural networks are one of the most efficient techniques for learning from scarce data. This property is very useful when trying to build a part-of-speech tagger. Available part-of-speech taggers need huge amounts of hand tagged text, but for Portuguese there is no such corpora available. In this paper we propose a neural network that, apparently, is capable of overcoming the huge training corp...
متن کاملNeural Network Approach to Thai Part Of Speech Tagging
Thai part of speech (POS) tagging is a challenged problem in natural language processing. Many techniques including artificial neural network techniques are suggested for POS tagging. Research works in Thai POS tagging so far only focused on assigning word types, but not word features. This paper proposed a technique using multilayer perception for tagging word features in Thai sentences. The f...
متن کاملAmharic Part-of-Speech Tagger for Factored Language Modeling
This paper presents Amharic part of speech taggers developed for factored language modeling. Hidden Markov Model (HMM) and Support Vector Machine (SVM) based taggers have been trained using the TnT and SVMTool. The overall accuracy of the best performing TnTand SVM-based taggers is 82.99% and 85.50%, respectively. Generally, with respect to accuracy SVM-based taggers perform better than TnTbase...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Heliyon
سال: 2023
ISSN: ['2405-8440']
DOI: https://doi.org/10.1016/j.heliyon.2023.e17175